Inclusion AI released LLaDA2.0-Uni. A 16B MoE diffusion LLM that handles image understanding, 1024px image generation, image editing, and interleaved text-image generation in a single model.
Trained a WAI-Anima character LoRA on RunPod (AnimaLoraToolkit + sd-scripts) for $1.22, but at inference the side-ponytail direction won't shift with Danbooru tags or natural language — a directional bias from Anima base. Full verification record.
WAI-Illustrious SDXL v17 tested on M1 Max 64GB ComfyUI against v16 with the same seed. Hires fix now auto-corrects hands and feet, the four rating tags (general/sensitive/nsfw/explicit) still drive NSFW output, and v16-trained LoRAs mostly carry over — with one case where they don't.
Z-Image has its own pixel art LoRAs, but can they actually convert photos to pixel art via i2i? Tested Z-Image Turbo, base model, and compared with Illustrious on M1 Max 64GB.
Tested WAI-Anima v1 on Windows + RTX 4060 Laptop GPU (8GB VRAM). Headless execution via ComfyUI API hit a tqdm OSError on startup, but launching ComfyUI normally generates a single image in 55 seconds. Includes the workaround and timing notes.
Tested WAI-Anima v1, Anima preview3-base, and WAI-Illustrious v160 side by side on M1 Max 64GB ComfyUI with same seed/prompt. WAI-Anima inherits Anima's atmospheric lighting and natural running poses but still loses to WAI-Illustrious on tag control and character consistency. Includes i2i pipeline test (denoise 0.5), ~275s generation times, and how the Anima derivative ecosystem (WAI-Anima, CottonAnima, Kirazuri, RDBT) expanded in two months.
Tested 5 approaches including Qwen Image Edit, JS color reduction, and Illustrious i2i + LoRA. Illustrious i2i alone turned out to be the fastest and lightest solution for pixel art conversion.
Testing See-through for anime character PSD decomposition: 23 generated layers, front/back hair separation, hidden-area inpainting, and what LayerDiff + Marigold actually produced from a single illustration.
Luma AI's Uni-1 integrates image understanding and generation in one decoder-only autoregressive model. It does not use diffusion; instead, it tokenizes text and image patches in a shared vocabulary and generates them sequentially.
Why ComfyUI breaks on NVIDIA Blackwell (sm_120) GPUs with 'no kernel image is available for execution' errors, and a working setup using PyTorch Nightly, xformers removal, SageAttention, and NVFP4 quantization. Tested on RTX PRO 6000 Blackwell.
Using tori29umai’s LoRA to automatically split facial parts, results from batching 28 images, and a log of running into the limits when attempting finer hair separation